Ultravox V0 5 Llama 3 2 1b ONNX
MIT
Ultravox is a multilingual audio-to-text model optimized based on the LLaMA-3-2.1B architecture, supporting speech recognition and transcription tasks in multiple languages.
Audio-to-Text
Transformers Supports Multiple Languages